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ABSTRACT Water plays an important role in physiological processes, such as the body's 
thermal equilibrium, the transfer of nutrients to the intended destination through the body, 
and the lubrication of joints. In Pakistan, the existing water availability is about 79%. 
Inadequate and adequate drinking water quality is a significant public health concern. In 
the project, we explain different machine learning techniques which are used to locate exact 
bacteria in a water sample, their shape, and scale. This technology promises sufficient 
identification and division. This invention allows for early identification of bacterial water 
pollution, requires minimal labor, etc. A robotic frame will speed up the treatment period 
without human power. It will reduce water emissions dramatically. The methods available 
for bacterial detection are effective but require lengthy waiting periods for results and 
expensive and laborious equipment. Via images with PYTHON (Its libraries), this research 
aims to detect bacteria utilizing images. This system tends to be effective and efficient way 
for water quality monitoring in different sectors in Pakistan. E.g., Wastewater treatment 
plants, Power plants, Industries, RO plants, and Laboratories. 


INDEX TERMS bacteria detection, machine learning 


I. INTRODUCTION most relevant conditions that can be 


; . : : considered in water [2]. 
Water is the life requirement of humanity. [2] 


Water plays an important role in Water, and all other life, are an essential 


physiological processes, such as the body's 
thermal equilibrium, the transfer of 
nutrients to the intended destination 
through the body, and the lubrication of 
joints [1]. The water used must therefore be 
clean and contain no microorganisms 
which cause disease. The drinking and use 
of the water standard (2005) by Turkish 
Standards Institute TS 266 stipulates the 
consistency standards water shall meet. 
Microbiological parameters are one of the 


source of humanity. From living animals to 
plants and all the rest. Any of these bacteria 
are toxic to human health anywhere in our 
world. Many bacteria, such as bacteria, can 
be contained in the water bodies (such as 
Escherichia coli, Vibrio, and salmonella). 
These appear to cause sickness or 
inflammation of animals such as renal 
failure, typhoid fever. Cholera, Diarrhea. 
The identification of waterborne microbes 
thus plays a major role in regulating water 
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quality and sanitation, thus reducing the 
risks of contaminating sources of water [3]. 


The bacterium Escherichia coli is one of the 
diseases in the water. E Coli is also one of 
the species of bacteria in living organisms' 
large intestines. But water contamination is 
caused by live-animal faces which can 
survive 4-12 weeks in water according to 
ambient conditions [4]. 


In light of the causes of infections, E. coli 
bacteria are of immense value to deter 
epidemics and reduce mortality quickly and 
effectively. A broad range of methods for 
detecting ŒE. coli bacteria has been 
developed. The process of bacterial 
detection usually requires the use of agar 
media [5] since agar is ideal for micro- 
organisms growth and provides 
components for metabolic activities by 
micro-organisms. The formation, 
separation, detection, counting, and 
susceptibility tests of microorganisms, as 
well as clinical monitoring, food, water, 
and environmental controls, are the key 
objectives of this culture of growth. Studies 
are often conceived in which bacteria, such 
as manganese added zinc, are marked with 
radiant chemicals, or analysed under a 
microscope [6]. 


A, BACKGROUND 


Pakistan is a country with an abundance of 
land and water resources. However, due to 
industrialization, urbanization, and 
population growth, water has become an 
increasingly important commodity. Water 
is essential to our way of life and 
development. In every aspect of our lives, 
it has an essential role [7]. Drinking water 
can often contain impurities from a variety 
of sources, including physical, biological, 
and chemical contamination. However, 
advances in water treatment technology 
have made it possible to remove many of 
these impurities, making drinking water 
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much safer. The most harmful impurity is 
biochemical, which induces or causes 
mortality. Several impurities are transferred 
from place to place in nutrients and 
microorganisms. Water leakage happens as 
microorganisms or hazardous substances 
are either in contact with water bodies or 
are emitted or released into groundwater or 
freshwater supplies from domestic and 
industrial waste. Animal and human fecal 
contamination reveal that coli type bacteria 
are present. Bacterial development and 
dispersion are high in the rainy season due 
to runoff of rivers, lakes, and streams in 
water bodies. Poor water-borne diseases 
propagate by facilities for treatment. In 
Pakistan, drinking water and drainage 
facilities run parallel, causing leaks, and 
mixing to contribute to water quality 
deterioration [8]. 


y 


ka 


FIGURE 1. Water pollution in Pakistan 


According to a recent report, the primary 
source of water in most Pakistani cities is 
groundwater, which unfortunately includes 
many pathogenic agents like viruses, 
bacteria, and protozoa. These agents cause 
2.5 million deaths from endemic diarrhea 
every year, making it a very serious issue in 
the country [9]. 


Water contamination is a physical 
phenomenon that takes place due to 
anthropogenic activity in different water 
supplies like wetlands, groundwater, and 
waterways. The use of low water quality 
indices and spreads waterborne diseases. 
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Around 50% of diseases and 40% of deaths 
are recorded in Community health research 
in Pakistan due to poor drinking quality. 
Around 80% of residents in Khyber 
Pakhtunkhwa province (KPK) use safe 
drinking water from surface and soil 
sources. Surface water in KP is clear and 
potable, but groundwater in the south is 
blackish in colour. In deepness, too, clean 
water is sought. However, with the aid of 
routine wells, excessive groundwater 
supply at KP is pumped out. Water supplies 
are primarily surface water and springs in 
the north of KP. The consistency and 
quantity of the drinking water are very 
limited as water and the old sewage systems 
are not well treated in urban areas [10]. 


B. HISTORY 


For millennia people have stored and 
distributed water. Before, river water for 
drinking water was being used while people 
lived as collectors. When people lived for a 
long time in a single location, it was 
generally next to the river or lake[11][12]. 
Where there were no rivers or lakes in a 
city, groundwater was used for drinking 
water. This has been pumped up by wells 
[13]. The water availability was no longer 
adequate as the human population began 
increasing considerably. Drinking water 
from a separate source had to be extracted. 


Jericho collected water some 7000 years 
ago in wells that were used as sources. 
People have already begun to develop 
water transport networks for drinking 
water. Transportation was achieved 
through clear channels excavated in rocks 
or the sand. One has used hollow tubes 
afterward. Hollow palm trees were used in 
Egypt and bamboo staunches were used by 
China and Japan. Finally, one began to use 
clay, wood, and copper. People in Persia 
were searching for rivers and lakes 


underground. The water went into the wells 
of the plateau by troughs in rocks. 


The town of Mohenjo-Daro used a large 
water source around 3000 BC. Public 
bathrooms with water boiler facilities and 
baths were provided in this area. Water and 
rainwater are used very early in ancient 
Greece in the season. With the exponential 
growth of the urban population, Greece was 
compelled by a distribution network to 
store water in water wells and transport the 
water to residents. The water used was 
transported along with rainwater into the 
sewers. The water was pushed through hills 
under pressure as valleys were entered. The 
Greek was one of the first people to become 
involved in water safety. The drinking 
water was purified using aeration basins 
[14]. 


The Romans were the best water delivery 
network builders and constructors in 
history. They were providing canals, 
springs, or groundwater. The Romans 
constructed river dams that created 
reservoirs. The water from the lake was 
aerated and delivered. Because of its 
consistency, mountain water was the most 
common kind. The aqueducts were 
designed for water transport. Water was 
carried about ten miles across these 


aqueducts. Pluming of brick, rock, brass, 
iron, wood, or plumb in the city was carried 
out. Water benefits were safeguarded 
against foreign toxins. 


FIGURE 1. Roman aqueduct 
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The aqueducts were not seen any more 
following the collapse of the Roman 
Empire. Few creations took place in the 
treatment area between 500 and 1500 AD. 
Numerous settlements have been 
manifested in the Middle Ages. Wood 
plumbing has been used in these cities. 
Water from ponds, mines, or outside the 
region has been drained. Soon, the 
conditions were extremely hygienically 
dumped into the water as wastes and 
excrements. People who drink water 
became ill and died frequently. People 
began to drink water from outside of the 
city to fix this crisis, with rivers unpolluted. 
The so-called water bearers took this water 
into the city [14]. 


John Gibb established in Paisley, Scotland 
the first water source for a whole city, in 
1804, to supply water for his bleaching 
factory and the rest of the area. Filtered 
water was carried to Glasgow in three 
years. A major water treatment plant was 
built in Paris in 1806. 12 hours before the 
filtering, the water-cooled. Sand and 
charcoal were used for filters and replaced 
all six hours. James Simpson, an 
Englishman, designed sand filters in 1827 
for purging drinking water. Today's public 
health salute remains the number one [15]. 


C. PROBLEM STATEMENT 


The methods available for bacterial 
detection are effective but require lengthy 
waiting periods for results and expensive 
and laborious equipment. Via images with 
PYTHON (Its libraries), this research aims 
to detect bacteria utilizing images. 


D. LIMITATIONS OF STUDY 


e The traditional method needs time for 
the production and replication of 
microbes necessary to be observable 
with the use of the human eye. 
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e Any plate needs to recall and record 
until the colonies are clear. 


e Some evaluations include serial 
incubation, with samples moved at a 
given time from one incubator to each 
other at a specific temperature [7]. 


E. PARAMETERS 
1) ESCHERICHIA COLI 


It is a kind of bacterium that typically lives 
in the bowels. It is also present in certain 
animals' intestines. E. coli are generally 
harmless and also contribute to maintaining 
a healthy digestive tract. However, if you 
eat polluted food or drink water from 
freshwater, certain strains can cause 


diarrhea [13]. 


As much of us use E. coli to connect food 
poisoning, pneumonia and urinary tract 
infection may also result from multiple 
forms of bacteria. E. coli is causing 75% to 
95% of urinary tract infections. E. coli is a 
regular bowel resident, so she passes into 
the urinary tract. 


Some E. coli versions leave you sick with 
the toxin Shiga. The poison destroys the 
bowel lining. You can get sick from a 
particularly weak strain, O157:H7. 
Abdominal cramps, vomiting, and bloody 
diarrhea are caused. This is the major cause 
of children's acute kidney failure. It can 
even trigger life-threatening signs like: 


Bleeding 

Confusion 

Adult kidney failure 
Fever 


In a week stable people with E. coli 
generally feel fine. However, some people 
have a grave problem called hemolytic 
uremic syndrome which affects the 
kidneys. Older persons and children are 


more likely to do so [13]. 
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2) SALMONELLA 


Salmonella is the most prevalent food- 
related cause in the United States of the 
type of bacterium. It can't be used, scented, 
or tasted. These bacteria's disease is 
formally known as salmonella disease. This 
can lead to an upset of the abdomen, 
diarrhea, fever, and pain. In 4 to 7 days, 
most people will get better themselves at 
home [14][15]. 


Salmonella is a very common infection, 
especially when it comes to food poisoning. 
In fact, it is the most popular infection when 
people discuss food poisoning. Every year, 
around the world, tens of millions of cases 
are registered. You must go to the doctor in 
acute circumstances. It will rarely threaten 
people with life. Summer infections are 
more prevalent than winter infections. This 
is because salmonella grows fast when food 
is not refrigerated at higher temperatures 


[14]. 


In their intestines and feces, people and 
animals may bear salmonella. Sometimes, 
the bacteria spread via contaminated foods. 
Raw and undercooked meat, including 
chicken, turkey, duck, beef, and pork are 
common food sources of salmonella 
infection. 


e Raw vegetables and fruits. 


e Unpasteurized milk and other 
products, such as soft cheese, yogurt, 
and ice cream, can pose serious health 
risks. 


e Eggs raw or inadequate to fry. 


e Processed foods such as nuggets for 
chicken and kinds of butter. 


F. START BACTERIAL DATASET 


This is the first step in the process of 
completing this project. First of all, we have 
to extract the dataset and import it into the 


Python environment. As we have two types 
of datasets (Escherichia Coli and 
Salmonella). We will load them separately. 
After the successful importing of datasets, 
we are ready to proceed with the next step. 


G. IMAGE ACQUISITION 


Image acquisition in the processing of 
images may be generally described as 
retrieval from any source, normally a 
hardware source, such that any subsequent 
processes can be passed along. 


Il. METHODOLOGY 


Start Bacterial dataset (Escherichia 


Coli and Salmonella) 


Image Acquisition 


Augmentation (Tensor flow and 


keras) 


Image Preprocessing (Resize the 


Image, noise removal 


Enhancement of images 


Apply differentalgorithms like K- 


Means Color based Clustering, SVMand 
Genetic Algorithm for Image Segmentation 


Image is segmented into threeregions 


FIGURE 3. Project methodology 
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FIGURE 4. Escherichia Coli and Salmonella 
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FIGURE 5. Image acquiring 


The first step in the image processing 
workflow series is typically image 
acquisition, as no processing can be done 
without the image. The picture obtained is 
unprocessed and is a product of some 
hardware that was used to produce it, which 
in certain fields can be very necessary for a 
consistent basis. One of the ultimate 
objectives is to obtain an input source that 
functions under such regulated and 
calculated guidelines that the same picture 
is almost completely reproducible, where 
possible, under the same conditions such 
that anomalous variables can be easily 
found and eliminated [8][16]. 
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A. IMAGE AUGMENTATION 


In the second step, we will augment the 
given image by using the tensor flow and 
Kera. Image data increase is a process that 
can be used to artificially increase the size 
of a dataset by generating modified image 
versions in the dataset. Rainy-deep neural- 
network-based learning models on more 
data can lead to better results, and the 
increased technologies will generate image 
variations that will enhance the suit model's 
capacity for generalizing what new pictures 
have learned. In Keras' deep neural network 
research library, image data increase 
through Image Data Generator class will 


suit models [9][17]. 
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A jupyter convnet Last Checkpoint: an hour ago (autosaved) 
Fie Edit View Insert e Kemel Widgets Help Not Tru 


B+ * @ Hh Av PRu BC H code 
tor aata_oatcn, 1aneis_oatcen in Train_generator: 
print('data batch shape:', dete_batch. shape) 
print('labels batch shape:', labels_batch. shape) 


break 
data batch shape: (32, 150, 150, 3) 
labels batch shape: (32,) 


In [50]: datagen = ImageDataGenerator( 
rotation_range-40, 
width_shift_range-0.2, 
height_shift_range-0.2, 
shear_range=0.2, 
zoom_range-0.2, 
horizontal _flip-True, 
fill _mode-'nearest') 


In [51]: from tensorflow.keras.preprocessing import image 
fnames = [os.path.foin(train_ecoli_dir, fname) for 
fname in os.listdir(train_ecoli_dir)] 
img_path = fnames(3) 
img = image.losd_img(img_peth, terget_size=(150, 150)) 
x = image. img to_array(img) 

x = x.reshape((1,) + x.shape) 
1-@ 
for batch in datagen.flow(x, batch_size=1): 
plt. figure(i) 
imgplot = plt.imshow(image.array_to_img(batch[@])) 
i1 
if 1% 4 == a: 
break 
plt.show() 


FIGURE 6. Image augmentation 
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FIGURE 7. Image augmentation 1 
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FIGURE 8. Image augmentation 2 
B. IMAGE PROCESSING 


After an image is taken, the next step is 
processing it to retrieve any valuable 
information from it. Image Processing is a 
method used to carry out various operations 
on an image to improve it. It is a form of 
signal processing in which the input is an 
image, and the output can be image or 
image features. The processing of 
photographs today is one of the fast- 
growing technologies. It also forms a 
central area of study in engineering and 
informatics. 


C. ENHANCEMENT OF IMAGES 


Enhancing images is the way to improve 
accuracy and content before retrieval of 
original files. Contrast improvement, 
spatial filtering, and density slicing are 
standard practices. 


D. APPLYING DIFFERENT 
ALGORITHMS 


After the image enhancement, we will be 
applying different algorithms to the image 
like K-Means Color-based Clustering, 
SVM, and Genetic Algorithm for Image 
Segmentation. 
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E. IMAGE SEGMENTATION 


Segmenting images is the act of splitting a 
digital image into many segments (sets of 
pixels, also known as image objects). 
Segmentation attempts to make a 
representation of an image smoother and/or 
easier to analyse in something more 
significant. Segmentation of images is 
usually used for image location of features 
and borders. More specifically, image 
segmentation is the process of dividing an 
image into multiple parts, or segments. 
Each pixel in an image is assigned a label, 
and pixels with the same label share similar 
features [10][18]. 


F. FEATURE EXTRACTION 


Extraction of features is a method of 
dimensional reduction that reduces an 
initial collection of raw data to more 
controlled processing classes. A feature of 
these massive data sets is a large number of 
variables requiring intensive use of 
computational resources 


G. EXTRACTING GLCM FROM 
IMAGE 


Extracting GLCM Grey Level Co- 
occurrence Matrices features from images 
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can help to calculate things like contrast, 


correlation, energy, entropy, and 
homogeneity. In addition, extracting 
bacterial types and symptoms and 


calculating the GLCM texture feature 
values of standard deviation, homogeneity, 
contrast, smoothness, correlation, kurtosis, 
energy, entropy, mean, variance, RMS, and 
IDM can be very helpful in understanding a 
sample. 


H. APPLYING CNN’S, RCNN’S, FAST 
RCNN’S AND RPN 


Finally, we will apply the machine learning 
methods to the selected and processed 
image to get the final result as shown in the 
following figure. In this step, we will Apply 
CNN’s, RCNN’s, fast RCNN’s and RPN’s 
for classification and types of bacterial 
detection display results with a higher 
percentage. 


HI. Literature Review 
A, RELATED WORK 


1. Early detection and classification of 
live bacteria using time-lapse coherent 
imaging and deep learning 


2. An Algorithm of Machine Learning for 
Identifying Bacteria using Digital 
Holographic Microscopy 


3. Identification of Bacteria using Digital 
Image Processing 


4. Detection and Analysis of Waterborne 
Bacterial Colonies Using Image 
Processing and Smartphones 


5. Study on Method of Bacteria Image 
Recognition 


6. Deep learning approach to bacterial 
colony classification 


7. Automatic Identification of 
Tuberculosis Mycobacterium 


8. Automated Counting of Bacterial 
Colonies by Image Analysis 


9. Water Quality Assessment by Image 


Processing 
10. Classification of Bacterial 
Contamination Using Image 


Processing and Distributed Computing 


11. Study on Method of Bacteria Image 
Recognition 


12. A ML Approach to 
Unknown Bacterial Serovars 


B. WATER QUALITY ASSESSMENT 
BY IMAGE PROCESSING 


Horak et al (2015) deal with a water quality 
assessment using an image processing 
method. Their suggested method for 
measurement of the water quality uses two 
well-known biological organisms sensitive 
to water toxicity. Their names are Daphnia 
magna and Lemna minor and they are 
frequently used for a water analysis from an 
ecotoxicology point of view. They 
suggested the method for water quality 
measurement, by designing an autonomous 
camera-based inspection. In our design, 
these two organisms are continuously 
scanned in separated vessels by two 
cameras and acquired images are then 
processed autonomously. There are 
employed methods of a _ colour-space 
transformation and analysis and a motion 


analysis in an image processing stage [1 1]. 


C. CLASSIFICATION OF BACTERIAL 
CONTAMINATION USING IMAGE 
PROCESSING AND DISTRIBUTED 
COMPUTING 


Detecting 


Ahmed et al [9] study, we used computer 
clusters to speed up the feature-extraction 
process, which enables us to analyse the 
contribution of different scatter-based 
features to the overall classification 
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accuracy. A set of 1000 scatter patterns 
representing ten different bacterial strains 
was used. Zernike and Chebyshev moments 
as well as Haralick texture features were 
computed from the available light-scatter 
patterns. The most promising features were 
first selected using Fisher's discriminant 
analysis, and subsequently a support- 
vector-machine classifier with a linear 
kernel was used. The use of distributed 
computing for scatter-pattern analysis, 
feature extraction, and selection provides a 
feasible mechanism for large-scale 
deployment of a light  scatter-based 


approach to bacterial classification [12]. 


D. STUDY ON METHOD OF 
BACTERIA IMAGE RECOGNITION 


They brought up the method to identify 
bacteria automatically. They designed a 
system as a hardware that contained a 
microscope, a digital camera, electric 
system and three step motors. The stage of 
the microscope was reconstructed and was 
connected with three step motors out of 
which two of them were plane motors and 
one remaining motor was for up and down 
for this to move three dimensionally in 
order to see the different regions of the 
slide. The digital camera was attached at the 
microscope eye piece for it to visualize the 
image or for capturing the image. The 
electric system performs three functions: 
communicating with PC _ software, 
reporting current status of the 
workstation, driving the step motors, 
recording the current position information. 
The slide was stained and diluted at first for 
the bacteria to be visualize at the 
microscope, the camera at ocular of the 
microscope captured the image. They 
further analysed the image by its colour and 
after filtering the image with two filters 
where threshold changes were made. The 
bacteria were identified by the filter. After 
that a new method connected to domain 
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was applied in which white pixels were 
labelled and shape was analysed if the 
shape is right it was labelled as bacterium . 


E. MOTIVATION 


In Pakistan, the existing water availability 
is about 79%. Inadequate and inadequate 
drinking water quality is a significant 
public health concern. The introduction 
into water bodies of radioactive products 
from urban and industrial environments, 
without any care, damages the consistency 
of the water and creates harmful effects for 
people. Water and sanitation agencies in 
Pakistan focused on water quantity, rather 
than water quality, because of the growing 
requirements. This is all due to insufficient 


knowledge, technology for treatments, 
facilities, qualified staff, and quality 
monitoring [11]. 

Different viruses, microbes, diverse 


minerals, and organic compounds present 
in healthy drinking water are adversely 
affected human health. A substantial 
number of people in developed countries 
are impacted by health conditions because 
of contaminated drinking water and 
microbial pollution. In developed 
countries, the polluted drinking water 
source has caused nearly five million 
children to die. This condition is growing 
day by day because of the exponential 
population increase, which leads inevitably 
to inadequate water quality management. In 
Pakistan, 30% of diseases and 40% of all 
deaths were believed to be attributable to 
the low quality of water. The most common 
cause of death in babies and children in 
Pakistan is diarrhea, which is a waterborne 
disease, and one in five people in Pakistan 
suffer from disease and disease caused by 


contaminated water [19]. 


Water quality decreases rapidly in most 
cities in Pakistan. Groundwater availability 
is the main source of declining water 
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quality. Every year in Pakistan there are 
about 100 million diarrheal cases. 
Infections of cholera, typhoidal diseases, 
dysentery, hepatitis, giardia, 
cryptosporidiosis, and guinea worm (which 
include diseases related to sanitation 
problems) account for nearly 80% of all 
illnesses and account for 33% of deaths 
[13]. 

IV. CONCLUSION 


Machine learning is being rapidly 
incorporated into almost every research 
area. The growing number of ML 
applications and use cases across a broad 
range of domains is a testament to this 
technology's potential. applications may no 
longer be quantifiable. The results show 
that ML is frequently utilized for 
microbiological research and focuses on 
difficulties of categorization and 
interaction analysis. Many difficulties 
remain unanswered, however, and 
researchers from many sectors, such as 
biology, information technology and 
medicine, will have to work together to 
support the development and improvement 
of microbiological research. The optimal 
technique may be employed by the 
microbiologist in his everyday practice 
according to completed experiments. In 
addition, when the data base size triples, 
this approach may also be utilized, as the 
accuracy of a method diminishes linearly 
with the database size. 


This review documented the research 
performed in Pakistan on the quality and 
contamination of potable water as the 
predominating and main contaminant 
owing to poor sanitary and sewage system 
sewerage water (fecal) mixture with 
drinking water. Chemical pollution from 
industrial waste toxics, textile dyes, 
pesticides and nitrogen fertilizers, arsenic 
and other chemicals is the second cause of 


the contamination. Regular inspections of 
existing treatment plants must be 
maintained and upgraded. Today, 
Pakistan's government will set up drinking 
water filters throughout Pakistan. The 
results showed that sewage pollution with 
drinking water should be regarded as a 
major health and environmental problem 
[19]. 


A, FUTURE WORK 


ML has a promising future in microbiology, 
despite these obstacles. As algorithms 
develop and trustworthy, they will become 
part of processes that will become available 
by researchers (for instance in Galaxy or 
Bioconductor). ML-algorithms of state-of- 
the-art, particularly neural networks for 
deeper learning, can handle with picture, 
text (for example, Pub-Med papers or 
genome / experimental annotations) and 
strings (such as genomic sequences). The 
Internet databases are packed with such 
data that were before too difficult to use. 
There is a wonderful chance to mix data 
from different sources and solve previously 
uncomfortable challenges. A 
microbiologist has, in our view, two 
options: taking up the ML revolution and 
beginning to become a competent ML 
practitioner or accepting that ML is 
something esoteric that mathematicians and 
IT scientists are better able to do. 


The next study will focus on the extension 
of the DIBaS data set, and the extension of 
the technique examined by providing 
colour distribution information. The 
accuracy of recognition should be further 
improved. 
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